首页> 外文OA文献 >Engineering Parallel Algorithms for Community Detection in Massive Networks
【2h】

Engineering Parallel Algorithms for Community Detection in Massive Networks

机译:大规模社区检测的工程并行算法   网络

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The amount of graph-structured data has recently experienced an enormousgrowth in many applications. To transform such data into useful information,fast analytics algorithms and software tools are necessary. One common graphanalytics kernel is disjoint community detection (or graph clustering). Despiteextensive research on heuristic solvers for this task, only few parallel codesexist, although parallelism will be necessary to scale to the data volume ofreal-world applications. We address the deficit in computing capability by aflexible and extensible community detection framework with shared-memoryparallelism. Within this framework we design and implement efficient parallelcommunity detection heuristics: A parallel label propagation scheme; the firstlarge-scale parallelization of the well-known Louvain method, as well as anextension of the method adding refinement; and an ensemble scheme combining theabove. In extensive experiments driven by the algorithm engineering paradigm,we identify the most successful parameters and combinations of thesealgorithms. We also compare our implementations with state-of-the-artcompetitors. The processing rate of our fastest algorithm often reaches 50Medges/second. We recommend the parallel Louvain method and our variant withrefinement as both qualitatively strong and fast. Our methods are suitable formassive data sets with billions of edges.
机译:最近,在许多应用程序中,图结构化数据量经历了巨大的增长。为了将此类数据转换为有用的信息,必须使用快速分析算法和软件工具。一种常见的graphanalytics内核是不相交的社区检测(或图聚类)。尽管针对此任务对启发式求解器进行了广泛的研究,但只有很少的并行代码存在,尽管要扩展到实际应用程序的数据量,并行性是必需的。我们通过具有共享内存并行性的灵活和可扩展的社区检测框架来解决计算能力的不足。在此框架内,我们设计并实现了有效的并行社区检测启发式算法:众所周知的Louvain方法的首次大规模并行化,以及对该方法的扩展以增加精炼;以及将上述方法结合在一起的整体方案。在由算法工程范式驱动的大量实验中,我们确定了最成功的参数以及这些算法的组合。我们还将实现与最先进的竞争对手进行比较。我们最快的算法的处理速率通常达到50Medges /秒。我们建议使用并行的Louvain方法以及经过改进的改进方法,因为它在质量上既强大又快速。我们的方法适用于具有数十亿条边的海量数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号